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1 Introduction 

Analyzing word complexity usually involves studying the fragments of a word or the 
process of its construction (see [2] for instance). Arnold introduced [1] a new concept of 
complexity of a word. The measure of this complexity is determined by the "stability" 
of a word under the iterated action of a certain operator. 

Consider an arbitrary nonperiodic binary word w = xiX2---X2 n , with w 7^ v k 
and k > 2, of length \w\ = 2 n for n > 1. Denote by (w) the infinite periodic word 
(w) — ww.... Henceforth, by a "word (w)" we understand "an infinite periodic 
word (w)" . Consider the scheme (word chain) 



in which the first word is arbitrary, and every word (wi) = t/iJ/2 ■ • ■ generates the next 
word (wi+i) = Z1Z2 . . ., for 1 < i < s — 1, using the operator 



where j > 1, 1 < hi — 2 n *, < nj < n, and ® stands for modulo 2 addition; thus, 
F(wi,hi) — (wi+i). The number hi is called the rank of the operator in (2), and the 
number s, the length of the scheme (1). 

Denote by S(w, v, h, s) the type of schemes with the first word (w), the last word 
(v), the maximal rank h of operators involved, and the scheme length s. For every (w) 



(w) = (wi), (w 2 ), ... , (w e ) = (v), 



(1) 



F(-, hi) : (w x ) h-> (w i+1 ) : z 3 = yj ® y j+hi , 



(2) 



there exists a minimal s such that all words in the scheme S(w, 0, 1, s) are distinct, 
and F(v, 1) = (0). A scheme of this type is called a complexity scheme. Every word 
(w) has a unique complexity scheme. The number s — 1 is called the complexity of 
the binary word (to) and is denoted by A(w). Arnold introduced [1] the concept of 
complexity of a binary word in a more general form, which coincides with our definition 
of complexity when the word length equals 2 n . The complexity of a periodic word (w) 
is equal to the complexity of the finite word w. 

In an arbitrary scheme S(w,v, 1, s) select a word chain 

(w) = (wi), Oi+jJ, . . . , (w 1+h+ ...+i t ) = (v), (3) 

where U > 1 for 1 < i < t < s — 1. If in (3) each word (wi+i 1 +...+i i ) for 1 < i < s — 1, 
coincides with F(wi+i 1 +...+i i _ 1 , hi), hi = U, then the word chain in (3) is a scheme of 
type S(w, v, h, St) with St = 1 + t, which is called equivalent to S(w, v, 1, s). 
In [3] we proved 

Theorem 1.1 Every scheme S(w,v,l,2 n + 1) with n > is equivalent to the 
elementary scheme S(w, v, 2™, 2). 

In a word w — x\X2 ■ ■ ■ X2™, n>l, select 2™ _m positions, with < m < n — 1, such 
that the distances between two neighboring selected positions are the same and equal 
to 2 m . Using the selected positions, form the word u = XiX i+ 2™ . . . x i+ 2 n -2 m of length 
2>i-m Tj eno t e t ne i nnn ite word (u) = uu . . . by (w)^* _ m and call it a thinned-out word. 
The number 2™ _m is called the step of the thinned-out word (w)^_ m . Observe that 
every thinned-out word is a linearly ordered set of indices of positions of w. The length 
of the period of (w)^_ m can be less than 2™ _m . For m = we have (w) — (w)^- 

Given a word (w), for a fixed value of m there exist 2 m different thinned-out words 

Hai-m > M£- ra , ■ ■ ■ , , n > 1, < m < n - 1. (4) 

For instance, for n = 3 we have (w)^ 1 = (a;iX2a;3a;4a;5a;6a;7X8), m = 0; 
(10)23-1 = (xix 3 x 5 x 7 ), (w)^-! = (a;2a;4a;6a:8), m = 1; 
M23-2 = (^1^5), (™)23 2 -2 = (a^ze), (w)2 3 3 -2 = (2:33:7), 

(w)a3-2 = (X4X 8 ), m = 2. 
Define the operation of taking the union of thinned-out words, denoted by the 
symbol *. The definition of a thinned-out word implies that each of the positions 
xi, X2, ■ ■ ■ , a?2" appears in the thinned-out words (4) exactly once since it is the union 
of arithmetic progressions with differences equal to powers of 2. Thus, we can express 
(w) as 

(w) = (w) X 2 l_ m * (w) X 2 ^_ m * ... * (w)*n™m n > ^> < 771 < n — 1. 

We group the words into two sorts: even words and odd words. The word (w) = 
ww . . ., where w — X\X2 ■ ■ ■ £2™ with Xi € {0, 1} and 1 < i < 2™ for n > 0, is called 
even whenever x\ © X2 © . . . © X2™ = 0, and odd whenever x\ © X2 © . . . © £2™ = 1. For 
calculating the parity of the thinned-out words {w) Xj n _ m , where |w| = 2" with n > 1 
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and < m < n — 1, of a word (X1X2 ■ ■ ■ %2 n ), we gave a simple algorithm [4], which 
uses modulo 2 addition 2 n — 1 times, and proved 

Theoreml.2 For every binary word (w) the length of whose period is equal to 2 n , 
n > 1, all thinned-out words {w)^ n _ m for < m < n — 1 and 1 < i < 2 m are odd i/ 
and only if the complexity of (w) is equal to A(w) — 2 n — 2 m + 1. 

Express the complexity A(w) of an arbitrary word (w) with |w| = 2™ for n > 1 as 

A(ti;) = a™-^"- 1 ' + a„_ 2 2 ( ™- 1) - 1 + . . . + a 2 (n - 1) - ( ™- 1) , (5) 

or the binary number a n _ia n _2 . . . ao, where a; 6 {0, 1} for < i < n — 1. 

Express the complexities A(w), which according to Theorem 1.2 we calculate by 
finding the parities of thinned-out words, as the binary numbers 

2" - 2 1 + 1 = o„_io„_2 . . .oo = 11 . . . Ill, 

2 n -2 2 + l = o„_io„_ 2 ...oo = 11...101, ... , (6) 

2 n -2"~ 1 + l = o„_io„_ 2 . . . oo = 10 . . . 001, 

2 n_ 2 n-i +0 = an _ ian _ 2 _ _ _ ao = 10... 000, n > 1, 

where all numbers are odd with the exception of 2" — 2 n ~ 1 . 

Refer to a word (v) as final if A («) equals one of the values in (6) . Every complexity 
scheme S(w, 0, 1, s) with s = 2™ for n > 1 contains n + (n — 1) + . . . + 1 final words. 

Using t > 1 operators (2) of ranks fti, h2, ■ ■ ., h t , transform the complexity scheme 
S(w, 0, 1, s) into a scheme 

(to) = (wi), (w 1+hl ), (w 1+hl+ ... +ht ) = (u), {w 1+hl + ... +ht+1 ), . . . , (0) 

with the final word (v). Then 

A(w) = hi + hi + . . . + ftt + v4(v), (7) 

where A(v) is one of the numbers (6). It is obvious that in order to transform (w) 
into («), every permutation of the ranks hi, /i2,---, h t of the operators F(-,hi) for 
1 < i < t is admissible. 



2 Shannon Function 

Refer to the minimal number of operators F(-, hi) required to transform (w) into one 
of the final words (v) as the complexity of transformation of (w) into (v), and denote 
it by A(w,v) — min A(w). 

Our goal is to find the Shannon function max min A(w), which we denote by Sh(n). 

w w — >v 

Consider an example. Take a complexity scheme (wis), (wis), • • • , (™2), (wi), (wo). 
For the words (w) with jw| = 2™ for < n < 4 five values of complexity exist, for 
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which we have expressions as in (7). In each of these cases an operator F(wi,2 r ) : 
(wi) h-> (wj) is used only once. Table 1 presents the results of calculating A(w) for all 
words (w) with |«;| = 2" for 1 < n < 4. 

Table 1 



(Wi) 


A(Wi) 


^ (w i; 2 J . (Wi) ^ (w,J 




(wie) 


2 4 — 2 U + 1 






(Wis) 


2 4 - 2 1 + 1 






(W14) 




F(wi4,2 u ) : (wi 4 ) 1 ^ (wis) 


A(tui4) = 2 U + (2 4 - 2' + 1) 


(wi 3 ) 


2 4 - 2^ + 1 






(W12) 




F(wi2, 2^) : (wi 2 ) 1 y (wa) 


A(wi 2 ) = 2' + (2 A - 2 U + 1) 


(wn) 




F(wu,2 L ) : (wn) 1 ^ (w 9 ) 


A(wn) = 2 1 + (2* - 2 J + 1) 


(wio) 




F(wi ,2 u ) : (wio) h-> (w 9 ) 


A(w 10 ) = 2" + (2 4 - 2 A + 1) 


(^9) 


2 4 - 2 3 + 1 






HO 


2 a - 2 ,J + 1 






(w?) 


2 ;i - 2 1 + 1 






(w 6 ) 




F(w 6 ,2 u ) : (w 6 ) h-> (w 5 ) 


A(w 6 ) = 2 U + (2 3 - 2* + 1) 


(w 5 ) 


2 ;l -2^ + 1 






(w 4 ) 


2^ - 2 U + 1 






(W3) 


2 2 - 2 1 + 1 






(w 2 ) 


2 1 - 2 U + 1 






Oi) 


2" 







In the next theorem we consider the general case for n > 5. 
Theorem 2.1 Given a word (w) with \w\ = 2 n for n > 5, we have 

1 < / L n — Sy'n + lj ui/ien t/ie binary number A(w) is odd, 
\ [n — 2^/n + 2J otherwise. 

Proof. Case 1. Assume that the value of the complexity A(w) is odd. 

Fix A(w) and estimate the minimal number of operators transforming (w) into a 
final word (v). It is obvious that in this case the ranks of all operators are distinct. 

In order to estimate A(w, v), consider the result of the action of the operator of (2) 
on the coefficients a„_i, a„_2, . . . , ao of (5). if 

F(u u h = 2 {n - 1) - 1 ) = (u 2 ), 0<i<n-l, 

then A(m) — A(u 2 ) = 2 < - n ~ 1 ^~ l . Moreover, two variants are possible for changing the 
values of a n -i, a„_2, . . . , ao'. 

(o ( „_i)_i = 1) h> (o ( „_i)_i = 0); (8) 
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(«(n-l)-i+j = 1) ^ ( a (n-l)-i+j = 0), 
(ffl(n-l)-i + (j-l) = 0) l-> (0(„_l)_ i+ (j_l) = 1), ... , (9) 
(<l(n-l)-i =0) ' ^ (0(„_i)_i = 1), 

where 1 < i < n — 1, 1 < J < i. 

In case (8) the rank of h — 2( n ~ 1 >~ 1 coincides with one of the terms in the sum (5). 
The action of the operator removes the term 2^ n_1 ' _l from (5). For instance, the 
operator of rank h = 2 1 transforms A(w) = 2 4 + 2 3 + 2 1 + 2° into 2 4 + 2 3 + 2°. 

In case (9), when the rank of h = 2' n ~ 1 '~ ! is distinct from all terms of (5), we 
remove the term 2' n ~ 1 '~' +; ' with minimal j. Simultaneously, we add to (5) the terms 

2(n-i)-<+0'-i) 2( n ~ 1 )~ i +0 -2 ) 2 < - n ~ 1 '>~ i (io) 

For instance, the operator of rank h = 2 1 transforms A(w) = 2 4 + 2 2 + 2° into 

2 4 + 2 1 +2°. 

Consider the case when we can apply (9) in order to calculate A(w,v). 

Suppose that (5) includes a run o» = dj-i = ... = a^ l+ i = 1 of neighboring 
unit coefficients of maximal length, where i < n — 1 and i + I — 1 > 1, which we 
denote by s(i,l). Several runs of maximal length may exist; for instance, A(w) — 

2 5 + 2 4 + 2 2 + 2 1 + 2° includes two such runs: s(5, 2) and s(2, 2). 

For a fixed value A(w) of complexity choose a run s(i, I) arbitrarily. If A(w) is 
distinct from (6) then the sum in (5), in addition to the I terms 2 l ,2'~ 1 , . . . ; 2 ! ~ i+1 
and the term 2°, also involves t distinct terms with 1 < t < n — I — 2. Once we remove 
these t terms, the remaining sum would coincide with one of the sums in (6). 

To remove t distinct terms from (5) using (8) we need t operators F(-,hi), of 
distinct ranks hi, hi, . . ., h t . For instance, in A(w) — 2 5 + 2 4 + 2 2 + 2 1 + 2° choose a 
run of neighboring unit coefficients of maximal length s(5, 2) and remove the terms 2 2 
and 2 1 . This yields the sum 2 5 + 2 4 + 2°, which coincides with one of the sums in (6). 

The transformation process A(w) i-¥ A(v) involves a unique case when the replace- 
ment of the variant (8) by the variant (9), in which the number of terms increases, 
fails to increase the number F(-, hi) of operators in the transformation A(w) <-¥ A(v). 
Moreover, the form of the final word changes: it additionally includes all terms of (10). 
This happens when in A(w) — 2 3 + 2 1 + . . . + 2° with j > i + 2 we choose a run s(i, I) 
and apply the operator F(w,h — 2 !+1 ). Then we remove from A(w) the term 2 J 
and transform the run s(i,l) into the run s(j — 1, 1 + (j — i — 1)). For instance, for 
A(w) = 2 5 + 2 3 + 2 2 + 2° choose the run s(3,2). Then the operator F(w,h = 2 4 ) 
transforms A{w) = 2 5 + 2 3 + 2 2 + 2° into 2 4 + 2 3 + 2 2 + 2°, while the run s(3, 2) goes 
into s(4, 3). 

Consequently, for removing t distinct terms from (5) the application of (9) is not 
necessary, and for finding A(w,v) we may use only the operators resulting in (8). 

Remark Every odd binary number A(w) includes the term 2°, which we do not 
remove while constructing A(w,v). Consequently, the operator of rank h = 1 is not 
used while obtaining A(w,v). 
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Denote by v(N) the number of l's in the binary expression for a nonnegative 
integer N. The arguments above imply that 

A(w, v) = min A(w) = v(A(w)) -2-1, 

w—tv 

where I is the length of the maximal run s(i, I). 

Let us find Sh(n) for nonfinal words (w) with \w\ — 2 n for n > 1. 

Construct a continuous function which, copying the process of removal of the 
maximal number of l's from a binary number A(w), determines an upper bound for 
Sh(n). 

Divide a line of integer length n > 4 into x segments, with 2 < x < n/ 2, of the 
same length n/x. Keeping one of the segments intact, remove the beginning of all 
other segments to leave only a finite part of unit length. Then the total length of the 
removed segments is estimated by the convex function 

f(x) = (x-l)(n/x-l), 

which has one extremum. Find the derivative f'(x) and set it equal to zero: 

f'(x) = n/x 2 -1 = 0. 

This yields x = ^fn and the maximal value attained by the function f(x), equal to 

f(x = y'n) — n — 2\pn + 1. 

For odd binary numbers A(w) Theorem is proved. 

Case 2. Assume that the value of the complexity A(w) is even. Estimate the 
minimal number of operators (2) required for calculating A(w, v). 

Suppose that the length n binary number A(w) includes u(A(w)) digits 1, where 
2 < v(A(w)) < n - 1, and 

«(n-i)-j = 1, a-(n-i)-j-i = 0, . . . , a = 0, 1 < j < n - 2. (11) 
Two variants for calculating A(w, v) are possible. 

Subcase 2.1. From the binary number A(w) = a n _ia„_2 . . . ao, which contains 
u(A(w)) digits 1, remove u(A(w)) — 1 digits 1 using (8). This yields a binary number 
A(v) with a unique digit 1. The number of operators A(w) t-¥ A(v) equals 

v(A(w))-l. (12) 

Subcase 2.2. Apply the operator F(w = wi, h = 1): (wi) t-¥ (w'z) once. As a result, 
the even number A(wi) goes into the odd number A(u>2), and A(w2) = A(w\) — 
1. Carry out further calculations according to the algorithm of case 1, in which by 
Remark 2.2 the operator (2) of rank 1 is not used. 
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Upon the application of F(w = wi,h = 1) to the number A(w) all binary digits 
in (11) switch their values in accordance with (9). Therefore, the number A(w 2 ) 
includes 

v{A{w 2 )) = v(A{w)) - 1+j 

digits 1. Removing from A(w 2 ) all digits 1 except for I of those in s(i, I) and ao = 1, 
we obtain A(v) with u(A(v)) = 1 + 1 digits 1. The number of operators transforming 
A(w) into A(v) equals 

v{A{w 2 )) - v(A(v)) + 1 = v(A(w)) (13) 

In order to estimate the complexity 

A(w,v) = min A(w) 

we choose the variant with the minimal number of operators. A comparison of (12) 
and (13) shows that this number occurs in subcase 2.1 for j > I and in subcase 2.2 for 
3<l- 

For instance, for A(w) = 101110100 we choose subcase 2.2: 

A(w) = 101110100, v{A{w)) = 5, j = 2; 

A(w 2 ) = 101110011, v{A{w)) - 1 + j = 6, 1 = 3; 

A(v) = 001110001, v{A(v)) = I + 1 = 4; 

A(w) ^ A(v) => 1 11 , v(A(w)) +j -1-1 = 3. 

Observe that if the operator F(w = wi,h = 1) : (wi) n> (w 2 ) in subcase 2.2 
generates an odd word (w 2 ), for which we have already established the estimate [n — 
+ lj, then for even A(wi) we have the estimate [n — + 2J since A(wi) = 
A(w 2 ) + 1. 
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